Separation of speech signals using iterative multi-pitch analysis and prediction
نویسندگان
چکیده
A model for multi-pitch analysis is extended into an iterative multi-pitch analysis and prediction (IMPAP) scheme. The method is efficient in finding harmonic complex tones, such as voiced speech signals, in a mixture of such signals and possible noise background. It can also be used to separate the signal into perceptually relevant speech components. The method may be used in applications ranging from speech transmission (enhancement) to recognition (noise and extra sound rejection).
منابع مشابه
RNN-BLSTM Based Multi-Pitch Estimation
Multi-pitch estimation is critical in many applications, including computational auditory scene analysis (CASA), speech enhancement/separation and mixed speech analysis; however, despite much effort, it remains a challenging problem. This paper uses the PEFAC algorithm to extract features and proposes the use of recurrent neural networks with bidirectional Long ShortTerm Memory (RNN-BLSTM) to m...
متن کاملObject-Based Sound Source Modeling for Musical Signals
This study presents a framework for audio and music processing which consists of an analysis and a synthesis path that are connected at three representational levels. Auditory signal analysis techniques include a multi-pitch analysis model, an eventdetector, and sinusoidal modeling that are combined in an iterative sound separation system. Techniques are presented for detection of perceptually ...
متن کاملJoint Pitch-Analysis Formant-Synthesis framework for CS recovery of speech
A joint analysis-synthesis framework is developed for the compressive sensing (CS) recovery of speech signals. The signal is assumed to be sparse in the residual domain with the linear prediction filter used as the sparse transformation. Importantly this transform is not known apriori, since estimating the predictor filter requires the knowledge of the signal. Two prediction filters, one comb f...
متن کاملCochannel speech separation using multi-pitch estimation and model based voiced sequential grouping
In this paper, a new cochannel speech separation algorithm using multi-pitch extraction and speaker model based sequential grouping is proposed. After auditory segmentation based on onset and offset analysis, robust multi-pitch estimation algorithm is performed on each segment and the corresponding voiced portions are segregated. Then speaker pair model based on support vector machine (SVM) is ...
متن کاملHarmonic filtering for joint estimation of pitch and voiced source with single-microphone input
Standard correlation based methods are not effective in estimating pitch tracks of multiple speech sources from a single-microphone input In this paper, an adaptive harmonic filtering is proposed to jointly estimate the source signals and their corresponding fundamental frequencies. By exploiting the harmonic structure of voiced speech, pitch information of one source is extracted from the pitc...
متن کامل